THE FILES IN THIS ARCHIVE CAN BE USED TO REPLICATE ALL FIGURES, TABLES, AND RESULTS MENTIONED IN-TEXT IN THE MAIN PAPER AND SUPPLEMENTAL INFORMATION FOR ``THE PRESIDENT WILL SEE WHOM NOW?  PRESIDENTIAL ENGAGEMENT WITH ORGANIZED INTERESTS."

THIS ARCHIVE AT THE TOP LEVEL CONTAINS THREE DOCUMENTS AND THREE SUBFOLDERS.  THE CONTENTS OF THE THREE DOCUMENTS ARE:

APSR_Final_SI.pdf--A COPY OF THE SUPPLEMENTAL INFORMATION FOR THE ARTICLE (ALSO AVAILABLE AT APSR'S WEBSITE)

IRB-Approval-Memo.rtf--IRB APPROVAL FORM FOR LOBBYIST SURVEY ISSUED BY WASHINGTON UNIVERSITY IN ST. LOUIS

survey_question_wording.txt--FULL QUESTION AND RESPONSE OPTION WORDING FOR ALL SURVEY QUESTIONS MENTIONED IN THE MAIN TEXT AND SUPPLEMENTAL INFORMATION

EACH SUBFOLDER CONTAINS A DIFFERENT TYPE OF DATA (AND ACCOMPANYING CODEBOOKS AND CODE TO REPLICATE RESULTS/FIGURES).  I DISCUSS THE CONTENTS OF EACH SUBFOLDER AND ANY SPECIAL INSTRUCTIONS FOR UTILIZING THE FILES CONTAINED THEREIN SEPARATELY BELOW.

SYSTEM SPECIFICATION NOTES

THIS REPLICATION ARCHIVE WAS CONSTRUCTED ON A SYSTEM WITH THE FOLLOWING SPECIFICATIONS (BUT NOTE INFORMATION FOR visitor_logs_analyses SUBFOLDER):

OPERATING SYSTEM--WINDOWS 10 X64 
R VERSION--4.0.2
STAN VERSION--2.21.0
RELEVANT R PACKAGES/VERSIONS
--brms--2.14.5
--data.table--1.13.0
--lubridate--1.7.9
--Rcpp--1.0.7
--stringr--1.4.0
--survey--4.0
--texreg--1.37.5
--weights--1.0.4

#########################################################################################################

survey_data

THIS SUBFOLDER CONTAINS MATERIALS TO REPLICATE ALL ANALYSES WHICH UTILIZE DATA FROM THE AUTHOR'S SURVEY OF FEDERAL LOBBYISTS.

THE CONTENT AND/OR PURPOSE OF EACH FILE IS DESCRIBED BELOW:

--lobbyist_survey_data.csv--THIS DATA FILE CONTAINS DESCRIPTIVE STATISTICS ON PERSONS IN THE ORIGINAL SAMPLING FRAME AND RESPONDENTS, AS WELL AS THE RESPONDENTS' ANSWERS TO QUESTIONS UTILIZED IN THE MAIN PAPER AND SUPPLEMENTAL INFORMATION.

--survey_data_analysis.R--THIS R SCRIPT REPLICATES ALL FIGURES, TABLES, AND RESULTS MENTIONED IN-TEXT IN THE MAIN PAPER AND SUPPLEMENTAL INFORMATION WHICH RELATE TO THIS SURVEY DATA.  COMMENTED LINES IN THE R SCRIPT PROVIDE INFORMATION ABOUT WHICH PARTS OF THE SCRIPT REPLICATE WHICH PARTS OF THE MAIN PAPER/SUPPLEMENTAL INFORMATION.

--survey_data_codebook.txt--THIS TXT FILE PROVIDES INFORMATION ABOUT EACH VARIABLE CONTAINED IN lobbyist_survey_data.csv.

#########################################################################################################

visitor_logs_descriptives

THIS SUBFOLDER CONTAINS MATERIALS TO REPLICATE ALL DESCRIPTIVE STATISTICS PRESENTED IN THE SUPPLEMENTAL INFORMATION CONCERNING THE WHITE HOUSE VISITOR LOGS THEMSELVES AND MATCHES MADE BETWEEN VISITORS AND FEDERAL LOBBYISTS.

THE CONTENT AND/OR PURPOSE OF EACH FILE IS DESCRIBED BELOW:

--biden_logs_2021.csv--THIS DATA FILE IS THE BIDEN WHITE HOUSE'S RELEASE OF VISITOR LOGS FOR 2021.  THIS DATA FILE WAS TAKEN DIRECTLY FROM THE BIDEN WHITE HOUSE'S WEBSITE.  WHILE THE PAPER DOES NOT UTILIZE THE BIDEN VISITOR LOGS, IT DOES MENTION A COMPARISON TO THE VOLUME OF VISITS MADE AT A SIMILAR POINT OF THE OBAMA PRESIDENCY IN FOOTNOTE SI.19.

--biden_logs_codebook.txt--THIS TXT FILE PROVIDES INFORMATION ABOUT EACH VARIABLE CONTAINED IN biden_logs_2021.csv.

--clinton_logs.csv--THIS DATA FILE CONTAINS INFORMATION ON VISITORS TO THE CLINTON WHITE HOUSE WHICH THE CLINTON LIBRARY IDENTIFIED AS OCCURRING IN 1998, 1999, OR 2000.  THESE LOGS WERE OBTAINED DIRECTLY FROM THE CLINTON LIBRARY (FOIA REQUESTS 2007-0779-F AND 2016-0727-F).  THIS DATA FILE CONTAINS ALL UNIQUE VISITS (I.E., ROWS) PROVIDED BY THE CLINTON LIBRARY.  COLUMNS 1 AND 2 ARE UNIQUE VISIT AND APPOINTMENT IDENTIFIERS, RESPECTIVELY.  COLUMNS 3-20 ARE DRAWN DIRECTLY FROM THOSE VISITOR LOGS WITHOUT MODIFICATION BY THE AUTHOR.  REMAINING COLUMNS CONTAIN INFORMATION ABOUT THE LOBBYIST MATCHED WITH EACH VISIT BY THE AUTHOR (WHERE A MATCH WAS MADE) AND THE VISITEE CORRESPONDING WITH THE VISIT.

--clinton_logs_codebook.txt--THIS TXT FILE PROVIDES INFORMATION ABOUT EACH VARIABLE CONTAINED IN clinton_logs.csv.

--obama_logs.csv--THIS DATA FILE CONTAINS INFORMATION ON VISITORS TO THE OBAMA WHITE HOUSE WHICH THE OBAMA ADMINISTRATION RELEASED DURING ITS TIME IN OFFICE.  THESE LOGS WERE OBTAINED FROM THE OBAMA WHITE HOUSE'S WEBSITE.  THIS DATA FILE CONTAINS ALL UNIQUE NON-CANCELLED VISITS (I.E., ROWS) PROVIDED BY THE OBAMA ADMINISTRATION.  COLUMNS 1 AND 2 ARE UNIQUE VISIT AND APPOINTMENT IDENTIFIERS, RESPECTIVELY.  COLUMNS 3-30 ARE DRAWN DIRECTLY FROM THOSE VISITOR LOGS WITHOUT MODIFICATION 
BY THE AUTHOR.  REMAINING COLUMNS CONTAIN INFORMATION ABOUT THE LOBBYIST MATCHED WITH EACH VISIT BY THE AUTHOR (WHERE A MATCH WAS MADE) AND THE VISITEE CORRESPONDING WITH THE VISIT.

--obama_logs_codebook.txt--THIS TXT FILE PROVIDES INFORMATION ABOUT EACH VARIABLE CONTAINED IN obama_logs.csv.

visitor_logs_descriptives.R--THIS R SCRIPT REPLICATES ALL TABLES AND RESULTS MENTIONED IN-TEXT IN THE SUPPLEMENTAL INFORMATION WHICH RELATE TO DESCRIPTIVE INFORMATION ABOUT THE VISITOR LOGS AND MATCHES MADE BETWEEN VISITORS AND FEDERAL LOBBYISTS.  COMMENTED LINES IN THE R SCRIPT PROVIDE INFORMATION ABOUT WHICH PARTS OF THE SCRIPT REPLICATE WHICH PARTS OF THE SUPPLEMENTAL INFORMATION.

#########################################################################################################

visitor_logs_analyses

THIS SUBFOLDER CONTAINS MATERIALS TO REPLICATE ALL EMPIRICAL ANALYSES PRESENTED IN THE MAIN PAPER AND SUPPLEMENTAL INFORMATION CONCERNING WHITE HOUSE ENGAGEMENT DURING THE CLINTON AND OBAMA PRESIDENCIES.

NOTE 1:  DUE TO THE SIZE OF THE UNDERLYING DATA FILES AND COMPLEXITY OF THE BAYESIAN MULTILEVEL MODELS, THE REPLICATION CODE UTILIZES BOTH BETWEEN- AND WITHIN-CHAIN PARALLELIZATION IN STAN.  UNFORTUNATELY, WITHIN-CHAIN PARALLELIZATION IS NOT SUPPORTED ON WINDOWS MACHINES.  CONSEQUENTLY, THE AUTHOR ESTIMATED ALL MODELS USING WINDOWS SUBSYSTEM FOR LINUX (WSL2) AND RETRIEVED FITTED MODEL OBJECTS USING RSTUDIO IN WINDOWS (WITH THE SYSTEM SPECIFICATIONS DESCRIBED AT THE TOP OF THIS README).  THE SPECIFICATIONS FOR WSL2 USED BY THE AUTHOR TO ESTIMATE THE MODELS ARE AS FOLLOWS:

LINUX DISTRIBUTION--Ubuntu 20.04.1
R VERSION--4.0.3
STAN VERSION--2.21.0
RELEVANT R PACKAGES/VERSIONS
--brms--2.14.0
--data.table--1.13.2
--Rcpp_1.0.5

ALL MODELS ARE ESTIMATED WITH 4 CHAINS AND 3 CORES PER CHAIN, FOR A TOTAL OF 12 CORES.  IF THIS PARALLELIZATION SCHEME IS ADJUSTED, RESULTS MAY NOT REPRODUCE EXACTLY.

NOTE 2:  UTILIZING STAN IN R (DONE IN THIS REPLICATION ARCHIVE USING THE R PACKAGE brms) REQUIRES SOME USER EFFORT TO INSTALL/CONFIGURE ONE'S COMPUTER.  PLEASE CONSULT INFORMATION FROM THE STAN DEVELOPMENT TEAM'S WEBSITE (https://mc-stan.org/) FOR ASSISTANCE, IF NECESSARY.

NOTE 3:  BECAUSE MANY OF THESE MODELS ARE COMPUTATIONALLY INTENSIVE, THIS FOLDER CONTAINS BOTH THE DATA AND CODE NEEDED TO REPLICATE THE MODELS AND THE FITTED MODEL OBJECTS THEMSELVES SO THAT INTERESTED READERS CAN EXAMINE THEM WITHOUT NEEDING TO ESTIMATE THEM THEMSELVES.

clinton_logs_analysis.csv--THIS DATA FILE CONTAINS ALL INFORMATION NEEDED TO REPLICATE ANALYSES CONCERNING THE CLINTON WHITE HOUSE'S ENGAGEMENT WITH ORGANIZED INTERESTS.  EACH ROW CORRESPONDS TO AN ORGANIZED INTEREST-TIME PERIOD OBSERVATION.  ONLY ORGANIZED INTEREST-TIME PERIOD OBSERVATIONS THAT FILED LOBBYING DISCLOSURE ACT REPORTS IN BOTH THE PRESENT TIME PERIOD AND THE IMMEDIATELY  PRECEDING TIME PERIOD (E.G., IN BOTH THE FIRST AND SECOND SEMESTERS OF 1999) ARE INCLUDED IN THE DATA FILE.

clinton_logs_analysis_codebook.txt--THIS TXT FILE PROVIDES INFORMATION ABOUT EACH VARIABLE CONTAINED IN clinton_logs_analysis.csv.

extract_brmsfit.R--THIS R SCRIPT CONTAINS A CUSTOMIZED VERSION OF THE FUNCTION texreg USES TO CREATE TABLES FROM brmsfit CLASS OBJECTS; THIS IS USED TO MAKE ALL REGRESSION SUMMARY TABLES EXCEPT TABLE SI.11, FOR WHICH NO texreg FUNCTION CAN BE EASILY ADAPTED TO MAKE THE TABLE.

fig3.R--THIS R SCRIPT USES tableSI6_col1.RData TO RECREATE FIGURE 3, WHICH PRESENTS THE PREDICTED PROBABILITIES OF ENGAGEMENT DURING THE CLINTON ADMINISTRATION WHEN INTERESTS' RESOURCE LEVELS AND PARTISAN ALIGNMENT ARE SET TO SPECIFIED LEVELS.

fig4.R--THIS R SCRIPT USES tableSI6_col2.RData TO RECREATE FIGURE 4, WHICH PRESENTS THE PREDICTED PROBABILITIES OF ENGAGEMENT DURING THE OBAMA ADMINISTRATION WHEN INTERESTS' RESOURCE LEVELS AND PARTISAN ALIGNMENT ARE SET TO SPECIFIED LEVELS.

fig5.R--THIS R SCRIPT USES tableSI11_cols1and2.RData AND tableSI11_cols3and4.RData TO RECREATE FIGURE 5, WHICH PRESENTS THE DIFFERENCES IN THE COEFFICIENTS FOR LOBBYING EXPENDITURES, CAMPAIGN CONTRIBUTIONS, AND PARTISAN ALIGNMENT ON HIGH- AND LOW-QUALITY ENGAGEMENT IN THE CLINTON AND OBAMA ADMINISTRATIONS.

maintext_calculations.R--THIS R SCRIPT CONDUCTS CALCULATIONS FOR TWO IN-TEXT REFERENCES TO ANCILLARY CALCULATIONS WHICH SUPPORT MY EMPIRICAL ANALYSIS AND ITS INTERPRETATION.  FIRST, I PROVIDE THE NECESSARY CODE TO REPLICATE THE NUMBER AND PERCENTAGE OF INTERESTS IN MY DATA THAT HAVE CFSCORES OR IGSCORES AS STATED ON PAGE 19.  SECOND, I USE tableSI11_cols1and2.RData AND tableSI11_cols3and4.RData TO CALCULATE THE PREDICTED PROBABILITIES OF HIGH-QUALITY ENGAGEMENT DURING THE CLINTON AND OBAMA ADMINISTRATIONS WHEN INTERESTS' CAMPAIGN CONTRIBUTIONS ARE SET TO ZERO AND TO THE FIRST QUARTILE VALUE OF CONTRIBUTIONS, AS DESCRIBED ON PAGE 29 IN FOOTNOTE 36.

obama_logs_analysis.csv--THIS DATA FILE CONTAINS ALL INFORMATION NEEDED TO REPLICATE ANALYSES CONCERNING THE OBAMA WHITE HOUSE'S ENGAGEMENT WITH ORGANIZED INTERESTS.  EACH ROW CORRESPONDS TO AN ORGANIZED INTEREST-TIME PERIOD OBSERVATION.  ONLY ORGANIZED INTEREST-TIME PERIOD OBSERVATIONS THAT FILED LOBBYING DISCLOSURE ACT REPORTS IN BOTH THE PRESENT TIME PERIOD AND THE IMMEDIATELY  PRECEDING TIME PERIOD (E.G., IN BOTH THE FIRST AND SECOND QUARTERS OF 2010) ARE INCLUDED IN THE DATA FILE.

obama_logs_analysis_codebook.txt--THIS TXT FILE PROVIDES INFORMATION ABOUT EACH VARIABLE CONTAINED IN obama_logs_analysis.csv.

SItables_and_text.R--THIS R SCRIPT USES THE DATA FILES AND FITTED BAYESIAN MULTILEVEL MODELS TO CALCULATE ALL IN-TEXT QUANTITIES MENTIONED IN THE SUPPLEMENTAL INFORMATION AND TO RECREATE ALL RELEVANT TABLES IN THE SUPPLEMENTAL INFORMATION (TABLES SI.6-SI.11).  EACH TABLE, WHEN RECREATED, IS SAVED AS A NEW .tex FILE.

tableSI6_col1.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE FIRST COLUMN OF TABLE SI.6; THE RESULTING MODEL OBJECT PROVIDED AS tableSI6_col1.RData.

tableSI6_col2.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE SECOND COLUMN OF TABLE SI.6; THE RESULTING MODEL OBJECT PROVIDED AS tableSI6_col2.RData.

tableSI7_col1.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE FIRST COLUMN OF TABLE SI.7; THE RESULTING MODEL OBJECT PROVIDED AS tableSI7_col1.RData.

tableSI7_col2.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE SECOND COLUMN OF TABLE SI.7; THE RESULTING MODEL OBJECT PROVIDED AS tableSI7_col2.RData.

tableSI8_col1.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE FIRST COLUMN OF TABLE SI.8; THE RESULTING MODEL OBJECT PROVIDED AS tableSI8_col1.RData.

tableSI8_col2.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE SECOND COLUMN OF TABLE SI.8; THE RESULTING MODEL OBJECT PROVIDED AS tableSI8_col2.RData.

tableSI8_col3.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE THIRD COLUMN OF TABLE SI.8; THE RESULTING MODEL OBJECT PROVIDED AS tableSI8_col3.RData.

tableSI8_col4.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE FOURTH COLUMN OF TABLE SI.8; THE RESULTING MODEL OBJECT PROVIDED AS tableSI8_col4.RData.

tableSI8_col5.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE FIFTH COLUMN OF TABLE SI.8; THE RESULTING MODEL OBJECT PROVIDED AS tableSI8_col5.RData.

tableSI8_col6.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE SIXTH COLUMN OF TABLE SI.8; THE RESULTING MODEL OBJECT PROVIDED AS tableSI8_col6.RData.

tableSI9_col1.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE FIRST COLUMN OF TABLE SI.9; THE RESULTING MODEL OBJECT PROVIDED AS tableSI9_col1.RData.

tableSI9_col2.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE SECOND COLUMN OF TABLE SI.9; THE RESULTING MODEL OBJECT PROVIDED AS tableSI9_col2.RData.

tableSI10_col1.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE FIRST COLUMN OF TABLE SI.10; THE RESULTING MODEL OBJECT PROVIDED AS tableSI10_col1.RData.

tableSI10_col2.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE SECOND COLUMN OF TABLE SI.10; THE RESULTING MODEL OBJECT PROVIDED AS tableSI10_col2.RData.

tableSI10_col3.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE THIRD COLUMN OF TABLE SI.10; THE RESULTING MODEL OBJECT PROVIDED AS tableSI10_col3.RData.

tableSI10_col4.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE FOURTH COLUMN OF TABLE SI.10; THE RESULTING MODEL OBJECT PROVIDED AS tableSI10_col4.RData.

tableSI11_cols1and2.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE FIRST AND SECOND COLUMNS OF TABLE SI.11; THE RESULTING MODEL OBJECT PROVIDED AS tableSI11_cols1and2.RData.

tableSI11_cols3and4.R--THIS R SCRIPT REPLICATES THE MODEL PRESENTED IN THE THIRD AND FOURTH COLUMNS OF TABLE SI.11; THE RESULTING MODEL OBJECT PROVIDED AS tableSI11_cols3and4.RData.



